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Phenotype-switching with and without sensing environment is a ubiquitous strategy of organisms 
to survive in fluctuating environment. Fitness of a population of organisms with phenotype-switching 
may be constrained and restricted by hidden relations as the entropy production in a thermal system 
with and without sensing and feedback is well-characterized via fluctuation relations (FRs) . In this 
work, we derive such FRs of fitness together with an underlying information-theoretic structure in 
selection. By using path-integral formulation of a multi-phenotype population dynamics, we clarify 
that the optimal switching strategy is characterized as a consistency condition for time-forward and 
backward path probabilities. Within the formulation, the selection is regarded as passive information 
compression, and the loss of fitness from the optimal strategy is shown to satisfy various FRs that 
constrain the average and fluctuation of the loss. These results are naturally extended to the 
situation that organisms can use an environmental signal by actively sensing the environment. FRs 
of fitness gain by sensing are derived in which the multivariate mutual information among the 
phenotype, the environment and the signal plays the role to quantify the relevant information in 
the signal for fitness gain. 
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Phenotype-switching is a strategy of living systems to 
survive in stochastically changing environment)^. Even 
if no environmental information is available, diversifica¬ 
tion of phenotypes by stochastic switching (known as 
bethedging) can lead to gain of fitness when a subpopu¬ 
lation with a resistant phenotype can survive in a harsh 
environment to feed the next generation|^Q. If environ¬ 
mental signal that conveys information of the environ¬ 
ment is exploitable, further gain of fitness is possible by 
switching into the phenotypes adapted to the future en¬ 
vironment (known as decision-making) |^-|^ . Ubiquitous 
observations of phenotype-switching and environmental 
sensing in living systems from higher organisms down to 
bacteria implies its actual fitness advantage over the di¬ 
versification loss and the metabolic load of switching and 
sensing mechanisms |]8|-|TT| . 

The fitness gain enjoyed by switching and sensing, 
however, must be constrained by the environmental 
statistics and the sensed information. On the one hand, 
previous investigations clarified such constrains for aver¬ 
age fitness gain at least in specific situations [|[ ^ 0, [l^ 
p^ . On the other hand, the similarity between evolution¬ 
ary dynamics and statistical physics suggests that 

more general relations may exist as the series of fluctua¬ 
tion relations (FRs) characterize not only the average but 
also the fluctuation of entropy production in a thermal 
system with and without sensing and feedback)^ p0| . 
Finding such relations is crucial to understand the con¬ 
straints and predicability of adaptive dynamics of or gan - 
isms in ever changing environment (fitness seascapes) pi| . 
In this work, by using a path-wise (path-integral) formu- 
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lation of the dynamics of growing population with multi¬ 
phenotypes we reveal such relations of fitness to¬ 

gether with the underlying information-theoretic struc¬ 
ture of selection. 

Let Xt € Sx and yt £ be a phenotype of a living 
organism and a state of environment at time t, respec¬ 
tively. For simplicity, possible phenotypic and environ¬ 
mental states are assume to be discrete. We also define 
paths (histories) of phenotype and environment up to 
time t as Xt := {xr\T £ [0,t]} and % ■= {VtIt £ [0,t]}, 
respectively. The population size of organisms in a phe¬ 
notype X at time t under a realization of an environmen¬ 
tal path % is denoted as (x). When the population 
size is sufficiently large for all x, Mf {x) can be approx¬ 
imated to be continuous as in |^, Phenotype of an 
organism, in general, switches stochastically over time 
depending on its state. The switching dynamics is mod¬ 
eled, for example, by a Markov transition probability T 
where T(a::'|x) = 1 as in [^. In addition, an organ¬ 
ism with a phenotype x under an environmental state y is 
assumed to duplicate asexually to produce its — I 

copies on average within the unit time interval where 
h : iSj, X iSy —> M. Then, the time-discrete dynamics of 
the population size, Nt {x), can be described (Fig. ||) as 

W4i(x') = ^T(a:»A4^(a:). (1) 

X 

Cumulative fitness of the population at t under an envi¬ 
ronmental path % is defined as := In 

the environmental path follows a path probability Q[9/], 
we can define the environmental ensemble average of 
the cumulative fitness as ('Ft) := ('I'[9^])Qjy.^j. More¬ 
over, with additional assumptions on % and T, the time- 
average of the cumulative fitness, := (iFf) /t can 
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FIG. 1. A schematic diagram of the population dynamics 
with phenotype switching!^]. Note that the actual model 
accounts for snfficiently large population, and the lineage of 
cells in the hgure illustrates behavior of only a small subset 
of the population. 


also reflect temporal averaging of long-term growth under 
one realization of the environment as limt_>.oo = 

limt_>.oo this work, finite t is considered. 

All formulations and definitions can be naturally ex¬ 
tended for the situation where an environmental signal 
Zt € Sz is available. Let Zt := {z{t)\t G [0,t]} be 
the path of the signal, and Q[f)^,Zf] be the joint paht- 
probability of the environment and the signal. The pop¬ 
ulation dynamics with the signal can be obtained by sim¬ 
ply replacing T(x'|a:) with 'f{x'\x,z') as 


where ©[PUPf] := i® the Kullback- 

Leibler divergence (KLD) between P and Pf [p^-|2^. This 
variational problem can be attained by a retrospective 
(time-backward) path probability p^ , defined as 

Vl\Xt] = (5) 

With this backward path probability, we have 

n%] = {m[Xu%])r^^-V[¥l\\^F]. ( 6 ) 

This path-wise formulation generally holds for more gen¬ 
eral Pf that may not be generated by Markov processes. 
In the following, therefore, we consider that PF[At] can 
be any path probabilities over the phenotypic history, Xt . 

The forward and backward path probabilities, P_F[At] 
and Pg[At], have an obvious interpretation. P_F[At] is the 
probability to observe an organism with a phenotypic his¬ 
tory Xt when we randomly sample an organism from the 
initial population at t = 0 and track it in a time-forward 
manner (Fig. p. When the tracked organism duplicates, 
we choose one of the two daughters randomly. PB[Ai], in 
contrast, is the probability to observe Xt when we ran¬ 
domly sample an organism from the hnal population at 
t and track it back retrospectively (Fig. |^). Because the 
backward path probability is defined for a hxed environ¬ 
mental history, we can also define a joint path proba¬ 
bility as Pg[At,9^] := P^[At]Q[9t]. We can also obtain a 
marginal path probabilities as Fg[Xt] 

With these probabilities, we have the average of the 
cumulative fitness as 


(x') = ^ T(x'|x, Zt+iW^’^{x). (2) 

X 


We also define the cumulative fitness and its av¬ 
erage as '^[%,Zt] := 


and (^'t) := 


As indicated in [^, the total population 

size at time t, Af^ := ■^t' i^')^ can be described 

with a path integral formulation. Let define a time- 
forward path probability of phenotype without sensing 
as FpiXt] := n^“oT(xr-Hi|xi-)PF(a;o) where PF(a;o) := 
J^o{xq)/Mq. In addition, a path-wise (historical) fit¬ 
ness of a phenotypic path Xt under an environmental 
path % is defined as B.[Xt,%] := YlV=oK^-r+i,yT+i)- 
Then, the population size of organisms that experi¬ 
ence the phenotypic path Xt under 9t at time t is 
M[Xu7t] = e®‘[^*'^*lPF[At]Wo||, H. Thus, we have 
A/f = A/'[Ai, 9t] and 


Because of this representation, '^^1%] can also be repre¬ 
sented variationally as 


{^t) = - V[F%^\\Fb], (7) 


where is a backward mutual informa¬ 
tion between Xt and 9t defined as := 

T.X ,, yt] In 13 ■ This relation gen¬ 

erally holds for any strategy of phenotypic switching. 
Among them, we focus on a special strategy, Pf)^!], 
that maximizes (di't). Because {"^t) is concave with 
respect to Pf, PF[At] is unique in the convex space 
of path probabilities. However PF[At] may not exists 
within biologically realistic class of path probability, 
e.g., ones generated by Markov or causal processes. If 
exits, this strategy can be regarded as the strategy that 
have adapted evolutionary to the environment defined 
by Q[9t]. Even if not, PF[At] and the corresponding 

:= maxpp (IF) still plays important roles as the 
bound of fitness in the FRs derived in the following. Such 
optimal strategy must satisfy the stationary condition, 
5 ('Ft) = 0, for any perturbation of the strategy (5 Pf 


around Pf. The condition can be explicitly represented 

\ _ / ip^w \ 

PjrfjCt] /^Pp \ IPpHt] /aPp 


as (5(d't) = (E 


From this equation, we obtain a consistency condition 
between the forward and backward probabilities as 


^'[Pt] = max[(]HI[At,9t])p - T>[P||Pp^]], 

P[Vt] 


% 


( 4 ) 


( 8 ) 
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The consistency condition requires no time-directionality 
in the phenotypic paths in the following sense. When we 
sample phenotypic paths in the time-forward manner, we 
have the ensemble of paths, that contains no in¬ 

fluence from the environment. When we sample paths in 
the time-backward manner without observing the envi¬ 
ronment, 9t, we have another ensemble of paths that fol¬ 
lows the marginal backward path probability [Xt] (Fig. 
^ (A)). While the forward and the backward path prob¬ 
abilities are the same marginally under the consistency 
condition, the selection induces correlation between the 
backward phenotypic dynamics and the environmental 
history. This fact is quantitatively described by the op¬ 
timal cumulative fitness obtained from eq. (|^ as 



FIG. 2. (A) A schematic diagram of the forward, backward 
and the marginal path-ensembles, P_F[Ai], and [Xt]. 

(B) Venn’s diagram for information among phenotype, envi¬ 
ronment, and signal. The region circled in bold black is 
and those in green and red foreground are and 

respectively. 


(Tt) = (H[At,irt])p.(9) 

where measures the correlation. This form of the 
optimal cumulative fitness can be further represented as 
another type of variational problem]^ as 


Furthermore, we also have the Kawai-Parrondo- 
Broeck(KPB)-type FR as 

(AT-t) = 2 ?[]Pf||Pf] - {V[PX])^ = 2^[QII (Pb)pJ- 

(13) 


'!'()= max 
PllXt] L 


(]HI[At, 9t])py [Xt]Q[yt] 


■x,y 


( 10 ) 


From the information-theoretic viewpoint, this is equiva¬ 
lent to the lossy soft compression or encoding of the envi¬ 
ronmental history, 9t, into the phenotypic history, Xt , un¬ 
der a utility measure MI (or equivalently distortion mea¬ 
sure —H) 1^ . The environmental history is composed of 
information relevant to and nothing to do with increase 
of the path-wise fitness H. The variational form of 
indicates that only information relevant for increasing H 
is imprinted or encoded into the phenotype history, and 
the optimal is regarded as the optimal encoder. 

This relation clarifies that selection can be regarded as a 
kind of passive information processing, and the backward 
mutual information, quantifies the information en¬ 

coded by the selection. 

From the consistency condition (eq. (||)), we have 
F%[Xt]q[!rt]/FB[Xt] ^ = Er, = 

1. This implies that PfID*] := P'^[Xt,7i]/F^[Xt] = 
holds for Xt G Supp[Pf] := 
{Af|PF[Ai] 7 ^ 0}. By taking average with any PfIa*] 
sharing the same support with Pf, we can easily see 
that the fitness loss of a suboptimal strategy defined as 
A\E'[9t] := ^1%] — ^1%] satisfies the following detailed 
FR: 


Pf [At] ^ 
PF[At]P^[At] 




PFty] 




( 11 ) 


The second term shows that the average loss of a sub- 
optimal strategy, (Aik), is determined by the strength 
of contraction of the phenotypic path probabilities from 
PIPfIIPf] to /ii[Pp||Ph]\ that is induced by selec- 

\ / Q[y]^ 

tion. The third term, in addition, shows that the loss 
is zero when equals to the statistics of en¬ 
vironment, In addition, this FRs can be used to 

quantify the loss by causal strategy even when Pf is not 
causa i[|g. 

All the result above, i.e., the consistency condition, 
the maximal cumulative fitness, and the FRs, can be 
generalized for the situation where the environmen¬ 
tal signal, Zt, is available. Let us define the for¬ 
ward path probability with the signal as Pp[At] := 

nt=o'^(*T--i-i|®T,-Zr-i-i)PF(®o) for T{x'\x,z). With this 
forward path probability, we similarly have the cumula¬ 
tive fitness with the signal, 'I'[9^, Zt] = In 

and the backward path probability, Pp ^[At] = 

gHty,yt]-'i'[ytZt]pz gg^gg tye signal, 

we consider general Pp than Markov or causal ones. The 
joint and marginal backward probabilities are also de¬ 
fined as P^[At,frt,2t] := P^’^[At]Q[frt,Zf], Pf [At,Zj] := 
Ey,PfAt,%,2,], Pf[At,%] := and 

Pf[At] := Eyt.ZtConditional probabili¬ 
ties are F%[Xu%] := Pf A*, fTt, At]/Q[At], P^[At,2t] := 
P^[At,ri,A*]/Q[9j], Pf’^[At] := Pf [At,At]/Q[A*], and 
Pf’^[At] := pf [At, 9t]/Q[f>t]. With these extensions, the 
average cumulative fitness, (T't) := {’^[‘Jt, 


An integral FR immediately follows as 
\ ! Q[Xt] 


(vkt) = (H[At,%])p. - (P[Pf’^[At]||P^[At]]) , 

B \ / Q[Zt] 

(12) where := (p[P|[At,%]||PB’^[At]Q[frt|At]])^^^^. 
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We also have the optimal strategy with the signal as 
Fp := argmaxpz ('I't) where we use “ to indicate the op¬ 
timal strategy with the signal to distinguish it from one 
without the signal. The optimal strategy satisfies the 
following extended consistency condition as 


= J2^F^B^[Xt]Q[%\Zt] = H[Xt], (14) 


and the corresponding maximal cumulative fitness is 


(^*) = (H[x*,%])p. (15) 

Similarly to the case without the signal, we can interpret 
this relation as an information compression of 9t to Xt 
with side information 

By using Pb’^[%] := ^/F^[Xt,Zt] 

derived from the extended 
consistency condition, we similarly have the fitness loss, 
A-^[%,Zt] := T[frt,2t] - by a suboptimal 

strategy with signal, Pf. as 

^ (p^-^[9i]) /q[%\zt\, (16) 

\ / rp[A:tJ 


When Ppi^t] shares the same support with ]PB[Xt] , by 
choosing the optimal strategy without signal as PB[Xt] = 
PF[X],the Sagawa-Ueda detailed FR [Q as 

where := Q[%, 2t]/Q[rt]Q[2*]. The KPB-type 

FR, = D[P^||]P^Pf] > 0, shows that 

jy.z 

is an upper bound of the average gain of fitness 
by sensing. Nonetheless, does not always properly 
quantify the gain of fitness by sensing. For example, if all 
phenotypes have identical growth under two environmen¬ 
tal states, y and y', i.e., h{x,y) = h{x,y') for all x, the 
information in the signal to distinguish y and y' has no 
contribution to fitness gain whereas increases. The 
information relevant for fitness can be evaluated more 


tightly by the following FR as 




Km 


(18) 


where := P^’^[%]/PbT he KPB-type 

FR shows that the multivariate mutual information, 
■= X^'^ — is the tighter bound for fitness 

gain by sensing as 


4- 




= (V[F- 


M.X\ 


( 19 ) 


p" 


p^ . In addition, the equality can be attained when 
the backward path probabilities of the optimal switching 
with and without sensing are identical as P“'" = P-. Be¬ 
cause is the residual information of the signal on 

the environment when we already know the phenotype 
path (Fig. 1^ (B)), Xp^'^ is the maximum information 
of the signal that can be imprinted into the phenotypic 
dynamics by selection, i.e., the information of the signal 
consumed and used in selection. 

In this work, we derived various FRs for fitness loss 
and gain with and without sensing the environment. 
These results generalize the previous results obtained by 
Kelly 0, Hacco and Iwasa|^, and others for the average 
of fitness gain and loss. In addition, by combining the 
FRs, we can also recover the result on the fitness gain by 
the optimal causal strategy derived in [Q( 


see 


27). The 


keystone for generalization was the introduction of path- 
wise formulation and the retrospective view of pheno¬ 
typic dynamics via the backward path probability. This 
also enables us to clarify an information-theoretic aspect 
of selection as passive compression of environmental dy¬ 
namics onto the retrospective phenotypic one. Active 
information processing by sensing interacts with this pas¬ 
sive processing, and thereby, the maximum gain of fitness 
by sensing is Quantified by the multivariate mutual infor¬ 
mation (eq. ([1^). Because of the shared mathematical 
structures, this work will be the basis for the integration 
of the information thermodynamics and evolutionary dy¬ 
namics to unveil the interdependencies among fitness, in¬ 
formation and entropy production|l6[j^. 
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